Interpretable Counterfactual Explanations Guided by Prototypes
نویسندگان
چکیده
We propose a fast, model agnostic method for finding interpretable counterfactual explanations of classifier predictions by using class prototypes. show that prototypes, obtained either an encoder or through specific k-d trees, significantly speed up the search instances and result in more explanations. quantitatively evaluate interpretability generated counterfactuals to illustrate effectiveness our on image tabular dataset, respectively MNIST Breast Cancer Wisconsin (Diagnostic). Additionally, we principled approach handle categorical variables Adult (Census) dataset. Our also eliminates computational bottleneck arises because numerical gradient evaluation black box models.
منابع مشابه
Explanations of Counterfactual Inferences
When engaging in counterfactual thought, people must imagine changes to the actual state of the world. In this study, we investigated how people reason about counterfactual scenarios by asking participants to make counterfactual inferences about a series of causal devices (i.e., answer questions such as If component X had not operated [had failed], would components Y, Z, and W have operated?) a...
متن کاملCausal Explanations in Counterfactual Reasoning
This paper explores the role of causal explanations in evaluating counterfactual conditionals. In reasoning about what would have been the case if A had been true, the localist injunction to hold constant all the variables that causally influence whether A is true or not, is sometimes unreasonably constraining. We hypothesize that speakers may resolve this tension by including in their delibera...
متن کاملMAGIX: Model Agnostic Globally Interpretable Explanations
Explaining the behavior of a black box machine learning model at the instance level is useful for building trust. However, what is also important is understanding how the model behaves globally. Such an understanding provides insight into both the data on which the model was trained and the generalization power of the rules it learned. We present here an approach that learns rules to explain gl...
متن کاملInterpretable and Informative Explanations of Outcomes
In this paper, we solve the following data summarization problem: given a multi-dimensional data set augmented with a binary attribute, how can we construct an interpretable and informative summary of the factors affecting the binary attribute in terms of the combinations of values of the dimension attributes? We refer to such summaries as explanation tables. We show the hardness of constructin...
متن کاملusing counterfactual analysis for providing historical explanations in social sciences
counterfactual analysis is concerned with explaining events that have not happened. counterfactuals are mental experiments through which one can reconstruct hypothetical versions of the history in one’s mind; these versions are relatively different from the real history, but provide one with the opportunity to test historical hypotheses against the available evidence. historicist researchers in...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2021
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-030-86520-7_40